Picture for Ruihua Song

Ruihua Song

Unified Synthesis of Compositional Speech and Sound from Free-Form Text Prompts

Add code
May 27, 2026
Viaarxiv icon

Pair-In, Pair-Out: Latent Multi-Token Prediction for Efficient LLMs

Add code
May 26, 2026
Viaarxiv icon

SyncDPO: Enhancing Temporal Synchronization in Video-Audio Joint Generation via Preference Learning

Add code
May 12, 2026
Viaarxiv icon

HuM-Eval: A Coarse-to-Fine Framework for Human-Centric Video Evaluation

Add code
Apr 28, 2026
Viaarxiv icon

Toward Autonomous Long-Horizon Engineering for ML Research

Add code
Apr 14, 2026
Viaarxiv icon

SentiAvatar: Towards Expressive and Interactive Digital Humans

Add code
Apr 03, 2026
Viaarxiv icon

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Add code
Mar 03, 2026
Viaarxiv icon

MSJoE: Jointly Evolving MLLM and Sampler for Efficient Long-Form Video Understanding

Add code
Feb 26, 2026
Viaarxiv icon

BFS-PO: Best-First Search for Large Reasoning Models

Add code
Feb 16, 2026
Viaarxiv icon

Restoring Exploration after Post-Training: Latent Exploration Decoding for Large Reasoning Models

Add code
Feb 02, 2026
Viaarxiv icon